Sample Efficient Reinforcement Learning With Domain Randomization for Automated Demand Response in Low-Voltage Grids

نویسندگان

چکیده

Automateddemand response programs are being increasingly used to address voltage and congestion issues on the low-voltage distributed grid due rapid proliferation of energy resources demand electrification. Data-driven methods, such as reinforcement learning (RL), can help realize these solutions in practice. However, algorithms have their own limitations, including high sample complexity limited capability generalize nonstationary settings. In this article, using actual data from residential buildings distribution grids Belgium The Netherlands, we investigate limits state-of-the-art RL-based controllers both centralized decentralized We also show that it is possible considerably improve performance RL by making use domain randomisation transfer learning. With proposed method, not necessary a fidelity simulation system under consideration demonstrate considering varying degrees misspecification. Our results technique improves naively posed even when model misspecified, helps minimize violations losses, while avoiding need for costly exploration. communication single-point-of-failure issues.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sample Efficient Reinforcement Learning with Gaussian Processes

This paper derives sample complexity results for using Gaussian Processes (GPs) in both modelbased and model-free reinforcement learning (RL). We show that GPs are KWIK learnable, proving for the first time that a model-based RL approach using GPs, GP-Rmax, is sample efficient (PAC-MDP). However, we then show that previous approaches to model-free RL using GPs take an exponential number of step...

متن کامل

Integration in Low - Voltage Feeders with Demand Response

The rapid growth of photovoltaics (PV) over the last few years has introduced many new challenges for power system operators. Most PV is integrated into the low-voltage network, where they can cause voltage and line constraint violations if generation exceeds local demand. This thesis explores the capabilities of using demand response to aid the integration of photovoltaic capacity into low vol...

متن کامل

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent’s optimal value function. In most real-world problems, learning this value function requires a function approximator, which maps state-action pairs to values via a concise, parameterized function. In practice, the success of function approximators depends on the ability of ...

متن کامل

Sample-efficient Deep Reinforcement Learning for Dialog Control

Representing a dialog policy as a recurrent neural network (RNN) is attractive because it handles partial observability, infers a latent representation of state, and can be optimized with supervised learning (SL) or reinforcement learning (RL). For RL, a policy gradient approach is natural, but is sample inefficient. In this paper, we present 3 methods for reducing the number of dialogs require...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE journal of emerging and selected topics in industrial electronics

سال: 2022

ISSN: ['2687-9743', '2687-9735']

DOI: https://doi.org/10.1109/jestie.2021.3117119